Fachbereich Informatik, Technische Universitat Kaiserslautern, 
Postfach 3049, 67653 Kaiserslautern, Germany 
zetzscheOcs . uni-kl . de 



in 



Silent Transitions in Automata with Storage 

Georg Zetzsche 
February 18, 2013 



We consider the computational power of silent transitions in one-way au- 
tomata with storage. Specifically, we ask which storage mechanisms admit a 
' transformation of a given automaton into one that accepts the same language 

ry | ■ and reads at least one input symbol in each step. 

We study this question using the model of valence automata. Here, a finite 
automaton is equipped with a storage mechanism that is given by a monoid. 

This work presents generalizations of known results on silent transitions. 
For two classes of monoids, it provides characterizations of those monoids 
pL^ | that allow the removal of A-transitions. Both classes are defined by graph 

products of copies of the bicyclic monoid and the group of integers. The 
first class contains pushdown storages as well as the blind counters while the 
second class contains the blind and the partially blind counters. 



> 

! 1 Introduction 

CO ■ We consider the problem of removing silent transitions from one-way automata with 

various kinds of storage. Specifically, we ask for which kinds of storage the real-time and 
the general version have equal computational power. 

This is an interesting problem for two reasons. First, it has consequences for the time 
and space complexity of the membership problem for these automata. For automata with 
silent transitions, it is not even clear whether the membership problem is decidable. If, 
however, an automaton has no silent transitions, we only have to consider paths that are 
at most as long as the word at hand. In particular, if we can decide whether a sequence 
of storage operations is valid using linear space, we can also solve the membership 
problem (nondeterministically) with a linear space bound. Similarly, if we can decide 
validity of such a sequence in polynomial time, we can solve the membership problem in 
(nondeterministic) polynomial time. 

Second, we can interpret the problem as a question on resource consumption of re- 
stricted machine models: we ask for which storage mechanisms we can process every 
input word by executing only a bounded number of operations per symbol. 
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There is a wide variety of machine models that consist of a finite state control with a 
one-way input and some mechanism to store data, for example (higher order) pushdown 
automata, various kinds of counter automata [12], or off-line Turing machines that can 
only move right on the input tape. 

For some of these models, it is known whether A-transitions can be eliminated. For 
example, the Greibach normal form allows their removal from pushdown automata 
Furthermore, for blind counter automata (i.e., the counters can go below zero and a 
zero-test is only performed in the end), Greibach also has also shown that A-transitions 
can be avoided [12]. However, for partially blind counter automata (i.e., the counters 
cannot go below zero and are only zero-tested in the end) or, equivalently, Petri nets, 
there are languages for which A-transitions are indeed necessary [T2 ^ [T6 | [TT] . 

The aim of this work is to generalize these results and obtain insights into how the 
properties of the storage mechanism influence the computational power of the real-time 
variant. 

In order to study the expressive power of real-time computations in greater generality, 
we use the model of valence automata. For our purposes, a storage mechanism consists 
of a (possibly infinite) set of states and partial transformations operating on them. 
Such a mechanism often works in a way such that a computation is considered valid 
if the composition of the applied transformations is the identity. For example, in a 
pushdown storage, the operations push and pop (for each participating stack symbol) 
and compositions thereof are partial transformations on the set of words over some 
alphabet. In this computation is valid if, in the end, the stack is brought back 

to the initial state, i.e., the identity transformation has been applied. Furthermore, in 
a partially blind counter automaton, a computation is valid if it leaves the counters 
with value zero, i.e., the composition of the applied operations increase and decrease is 
the identity. Therefore, the set of all compositions of the partial transformations forms 
a monoid such that in many cases, a computation is valid if the composition of the 
transformations is the identity map. 

A valence automaton is a finite automaton in which each edge carries, in addition to 
an input word, an element of a monoid. A word is then accepted if there is a computation 
that spells the word and for which the product of the monoid elements is the identity. 
Valence automata have been studied throughout the last decades [5H71fT H [r5 | ll8 y2UIf2"2"] . 

The contribution of this work is threefold. On the one hand, we introduce a class 
of monoids that accommodates, among others, all storage mechanisms for which we 
mentioned previous results on silent transitions. The monoids in this class are graph 
products of copies of the bicyclic monoid and the integers. On the other hand, we present 
two generalizations of those established facts. Our first main result is a characterization 
of those monoids in a certain subclass for which A-transitions can be eliminated. This 
subclass contains, among others, both the monoids corresponding to pushdown storages 
as well as those corresponding to blind multicounter storages. Thus, we obtain a gener- 
alization and unification of two of the three A-removal results above. For those storage 
mechanisms in this subclass for which we can remove A-transitions, there is a simple 
intuitive description. 

The second main result is a characterization of the previous kind for the class of those 
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storage mechanisms that consist of a number of blind counters and a number of partially 
blind counters. Specifically, we show that we can remove A-transitions if and only if there 
is at most one partially blind counter. Again, this generalizes and unifies two of the three 
results above. 

The rest of the paper is organized as follows. In Section [2l we will fix notation and 
define some basic concepts. In Section [3J we state the main results, describe how they 
relate to what is known, and explain key ideas. Sections [H [5l and [6] contain auxiliary 
results needed in Section which presents the proofs of the main results. 

2 Basic Notions 

We assume that the reader has some basic knowledge on formal languages and monoids. 
In this section, we will fix some notation and introduce basic concepts. 

A monoid is a set M together with an associative operation and a neutral element. 
Unless defined otherwise, we will denote the neutral element of a monoid by 1 and its 
operation by juxtaposition. That is, for a monoid M and a, 6 G M, ab G M is their 
product. For a, b G M, we write a C b if there is a c G M such that b = ac. By 1, we 
denote the trivial monoid that consists of just one element. 

We call a monoid commutative if ab = ba for any a, b G M. A subset ]V C M is said 
to be a submonoid of M if 1 G N and a, 6 G A implies ab G A. For a subset A C M, let 
(A) be the intersection of all submonoids N' of M that contain A. That is, (A) is the 
smallest submonoid of M that contains A. (A) is also called the submonoid generated 
by N. In each monoid M, we have the following submonoids: 

H(M) = {a£M\3b£M:ab = ba=l}, 
R(M) = {a G M | 3b G M : ab = 1}, 
L(M) = {a G M | 36 G M : 6a = 1}. 

When using a monoid M as part of a control mechanism, the subset 

J(M) = {a G M | 36, c G M : 6ac = 1} 

will play an important role. By M n , we denote the ra-fold direct product of M, i.e. 
M n = M x • • • x M with n factors. 

Let S C M be a subset. If there is no danger of confusion with the n-fold direct 
product, we write S n for the set of all elements of M that can be written as a product 
of n factors from S. 

Let S be a fixed countable set of abstract symbols, the finite subsets of which are 
called alphabets. For an alphabet X, we will write X* for the set of words over X. The 
empty word is denoted by A G X* . Together with the concatenation as its operation, 
X* is a monoid. We will regard every x G X as an element of X* , namely the word 
consisting of only one occurrence of x. For a symbol x G X and a word u) G X* , let 
be the number of occurrences of x in to. For a subset F C A, let \w\y = J2 x eY \ w \x- 
By \w\, we will refer to the length of w. Given an alphabet X and a monoid M, subsets 
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of X* and X* x M are called languages and transductions, respectively. A family is a 
set of languages that is closed under isomorphism and contains at least one non-trivial 
member. For a subset Y Q X, we define the homomorphism Try '■ X* — > Y* by ny (y) = y 
for y € Y and TTy(x) = A for x £ X \ Y. 

Given an alphabet X, we write X® for the set of maps a : X — > N. Elements of X® 
are called multisets. By way of pointwise addition, written a + /3, X® is a commutative 
monoid (also called the /ree commutative monoid over X). We write for the empty 
multiset, i.e. the one that maps every x € X to £ N. For a 6 X®, let |a| = X^ex ^O^)- 
The Parikh mapping is the mapping \& : S* — >■ X® defined by \P(u/)(;c) = \w\ x for all 
and x € S. 

Let A be a (not necessarily finite) set of symbols and R Q A* x A* . The pair (A, R) is 
called a (monoid) presentation. The smallest congruence of A* containing i? is denoted 
by =r and we will write [w]r for the congruence class of w £ A* . The monoid presented 
by (A, R) is defined as A*/=r, Note that since we did not impose a finiteness restriction 
on A, every monoid has a presentation. Furthermore, for monoids Mi, M2 we can find 
presentations (A\,Ri) and (^2,^2) such that A\ n A2 = 0- We define the /ree product 
Mi * M 2 to be presented by (Ai U A 2 , Ri U E 2 ). Note that Mi * M 2 is well-defined up to 
isomorphism. By way of the injective morphisms [w]^ i-> [w]^^^, w € A* for i = 1, 2, 
we will regard Mi and M2 as subsets of Mi *M2. In analogy to the ra-fold direct product, 
we write M^ n ' for the n-fold free product of M. 

Rational Sets Let M be a monoid. An automaton over M is a tuple A = (Q, M, E, qo,F), 
in which Q is a finite set of states, E is a finite subset of Q x M x Q called the set of 
edges, qo € Q is the initial state, and F C Q is the set of /maZ states. The step relation 
=>A of j4 is a binary relation on Q x M, for which (p, a) =>a (OS b) iff there is an edge 
(p, c, q) such that b = ac. The set generated by A is then 

5(A) = {a G M I 3<j G F : (go, 1) =^ (g, a)}. 

A set C M is called rational if it can be written as R = 5(A) for some automaton 
A over M. The set of rational subsets of M is denoted by RAT(M). Given two subsets 
S,T C M, we define 5T = {st | s G 5, t € T}. Since {1} G RAT(M) and 5T € RAT(M) 
whenever 5, T € RAT(M), this operation makes RAT(M) a monoid itself. 

Let C be a commutative monoid for which we write the composition additively. For 

n G N and c G C, we use nc to denote cH he (n summands). A subset 5 C C is linear 

if there are elements so, • • • , s n such that 5 = {so + Ya=1 a * s * I a * e 1 < i < n}. 
A set 5 C C is called semilinear if it is a finite union of linear sets. By SL(C), we 
denote the set of semilinear subsets of C. It is well-known that RAT(C) = SL(C) for 
commutative C (we will, however, sometimes still use SL(C) to make explicit that the 
sets at hand are semilinear). Moreover, SL(C) is a commutative monoid by way of the 
product (5, T) i-> 5 + T = {s + t \ s € 5, t S T}. It is well-known that the class 
of semilinear subsets of a free commutative monoid is closed under Boolean operations 
(see [S:, 9J). 

In slight abuse of terminology, we will sometimes call a language L semilinear if the 
set *&(L) is semilinear. If there is no danger of confusion, we will write 5® instead of 
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(S) if S is a subset of a commutative monoid C. Note that, if A is regarded as a subset 
of the set of multisets over A, the two meanings of X® coincide. 

Valence Automata A valence automaton over M is an automaton A over X* x M, 
where X is an alphabet. An edge (p, w, m, q) in A is called a X-transition if w = X. A 
is called X-free if it has no A-transitions. The language accepted by A is defined as 

L(A) = {w G X* | (w,l) G S(A)}. 

The class of languages accepted by valence automata and A-free valence automata over 
M is denoted by VA A (M) and VA(M), respectively. 

A finite automaton is a valence automaton over the trivial monoid 1. For a finite 
automaton A = (Q,A* x l,E,qo,F), we also write A = (Q, X, E,qo, F). Languages 
accepted by finite automata are called regular languages. The finite automaton A is 
spelling, if E C Q x A x Q, i.e. every edges carries exactly one letter. Let M and C 
be monoids. A valence transducer over M with output in C is an automaton A over 
X* x M x C, where A is an alphabet. The transduction performed by A is 

T(A) = {(x, c) G A* x C | (x, 1, c) G 5(A)}. 

A valence transducer is called X-free if it is A-free as a valence automaton. We denote 
the class of transductions performed by (A-free) valence transducers over M with output 
in C by VT A (M, C) (VT(M, C)). 

Graphs A graph is a pair T = (V, E) where V is a finite set and E C {S 1 C F | |5| < 2}. 
The elements of F are called vertices and those of 22 are called edges. If {w} G 22 for 
some f G V, then t> is called a looped vertex and otherwise it is unlooped. A subgraph of 
r is a graph (V"', 22') with V' and 22' C 22. Such a subgraph is called induced (by V' ) 
if 22' = {S G 22 | S C V'}, i.e. 22' contains all edges from 22 incident to vertices in V'. 
By r \ {v}, for v G V, we denote the subgraph of T induced by V \ {v}. Given a graph 
r = (V, 22), its underlying loop-free graph is T' = (V, 22') with E' = En{S C V | |5| = 2}. 
For a vertex u G V, the elements of 7V(u) = {w G F [ {v, w} G 22} are called neighbors 
of v. A looped clique is a graph in which 22 = {S'CV r | jS 1 ) <2}. Moreover, a clique is 
a loop-free graph in which any two distinct vertices are adjacent. Finally, an anti- clique 
is a graph with 22 = 0. 

A presentation (A, R) in which A is a finite alphabet is a Thue system. To each 
graph T = (V,E), we associate the Thue system Tr = (Ap,2?r) over the alphabet 
Ar = {a v ,a v \ v G V}. Rr is defined as 

Rr = {(a v a v , X) \ v G V} U {(xy, yx) \ x G {a„, a„}, y G a w }, {v, w} G 22}. 

In particular, we have ( ) G Rr whenever {v} G 22. To simplify notation, the 

congruence =t f is then also denoted by =r- In order to describe the monoids we use 
to model storage mechanisms, we define monoids using graphs. To each graph T, we 
associate the monoid 

Mr = X r /= r . 
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Graph T Monoid Mr Storage mechanism 



Pushdown (with three symbols) 
Three partially blind counters 



Three blind counters 



Pushdown (with two symbols) and two blind counters 



Table 1: Examples of storage mechanisms 

If r consists of one vertex and has no edges, Mr is also denoted as B and we will refer 
to it as the bicyclic monoid. The generators a v and a v are then also written a and a, 
respectively. 

3 Results 

Storage mechanisms as monoids First of all, we will see how pushdown storages and 
(partially) blind counters can be regarded as monoids of the form Mr. See Table [T] for 
a set of examples. It is not hard to see that in the bicyclic monoid B, a word over the 
generators a and a is the identity if and only if in every prefix of the word, there are 
at least as many o's as there are a's and in the whole word, there are as many o's as 
there are a's. Thus, a valence automaton over B is an automaton with one counter that 
cannot go below zero and is zero in the end. Here, the increment operation corresponds 
to a and the decrement corresponds to a. 

Observe that building the direct product means that both storage mechanisms (de- 
scribed by the factors) are available and can be used simultaneously. Thus, valence 
automata over B n are automata with n partially blind counters. Therefore, if V is a 
clique, then Mr = W 1 corresponds to a partially blind multicounter storage. 

Furthermore, the free product of a monoid M with B yields what can be seen as a stack 
of elements of M: a valence automaton over M * B can store a sequence of elements of 
M (separated by a) such that it can only remove the topmost element if it is the identity 
element. The available operations are those available for M (which then operate on the 
topmost entry) and in addition push (represented by a) and pop (represented by a). 
Thus, B * B corresponds to a stack over two symbols. In particular, if T is an anti-clique 
(with at least two vertices), then Mr = B^") represents a pushdown storage. 
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Finally, valence automata over Z n (regarded as a monoid by way of addition) corre- 
spond to automata with n blind counters. Hence, if T is a looped clique, then Mr = Z n 
corresponds to a blind multicounter storage. 

Main results Our class of monoids that generalizes pushdown and blind multicounter 
storages is the class of Mr where in T, any two looped vertices are adjacent and any two 
unlooped vertices are not adjacent. Our first main result is the following. 

Theorem 3.1. Let T be a graph such that 

• between any two looped vertices, there is an edge, and 

• between any two unlooped vertices, there is no edge. 
Then the following assertions are equivalent: 

(1) VA(Mr) = VA A (Mr). 

(2) Every language in VA A (Mr) is context-sensitive. 

(3) The membership problem of each language in VA A (Mr) is in NP. 

(4) Every language in VA A (Mr) is decidable. 

(5) r does not contain • * © • as an induced subgraph. 

Note that this generalizes the facts that in pushdown automata and in blind counter 
automata, A-transitions can be avoided. 

It turns out that the storages that satisfy the equivalent conditions of Theorem 13.11 
(and the hypothesis), are exactly those in the following class. 

Definition 3.2. Let C be the smallest class of monoids such that 1 € C and whenever 
M € C, we also have MxZeC and M*B £ C. 

Thus, C contains those storage types obtained by successively adding blind counters 
and building a stack of elements. For example, we could have a stack each of whose entries 
contains n blind counters. Or we could have an ordinary pushdown and a number of 
blind counters. Or a stack of elements, each of which is a pushdown storage and a blind 
counter, etc. 

Our second main result concerns storages consisting of a number of blind counters and 
a number of partially blind counters. 

Theorem 3.3. Let T be a graph such that between any two distinct vertices, there is an 
edge. Then 

VA(Mr) = VA A (Mr) if and only if r < 1, 
where r is the number of unlooped vertices in T. 

In other words, when you have r partially blind counters and s blind counters, A- 
transitions can be eliminated if and only if r < 1. Note that this generalizes Greibach's 
result that in partially blind multicounter automata, A-transitions are indispensable. 
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Key technical ingredients As a first step, we show that for M G C, all languages 
in VA A (M) are semilinear. This is needed in various situations throughout the proof. 
We prove this using an old result by van Leeuwen [23], which says that languages that 
are algebraic over a class of semilinear languages are semilinear themselves. Thereby, 
the corresponding lemma 14.41 slightly generalizes one of the central components in a 
decidability result by Lohrey and Steinberg on the rational subset membership problem 
for graph groups [19] and provides a simpler proof (relying, however, on van Leeuwen's 
result). 

Second, we use an undecidability result by Lohrey and Steinberg [19] concerning the 
rational subset membership problem for certain graph groups. We deduce that for 
monoids M outside of C (and satisfying the hypothesis of Theorem l3.ip . VA A (M) contains 
undecidable languages. 

Third, in order to prove our claim by induction on the construction of M £ C, we use a 
significantly stronger induction hypothesis: we show that it is not only possible to remove 
A-transitions from valence automata, but also from valence transducers with output in 
a commutative monoid. Here, however, the constructed valence transducer is allowed to 
output a semilinear set in each step. Monoids that admit such a transformation will be 
called strongly X-independent. 

Fourth, we develop a normal form result for rational subsets of monoids in C (see 
section [6]). Such normal form results have been available for monoids described by 
monadic rewriting systems (see, for example, [1]), which was applied by Render and 
Kambites to monoids representing pushdown storages [22] . Under different terms, this 
normal form trick has been used by Bouajjani, Esparza, and Maler [2 J and by Caucal [3] 
to describe rational sets of pushdown operations. However, since the monoids in C allow 
commutation of certain non-trivial elements, a more general technique was necessary 
here. In the case of monadic rewriting systems, one transforms a finite automaton 
according to rewriting rules by gluing in new edges. Here, we glue in automata accepting 
sets that are semilinear by earlier steps in the proof. See Lemma 16.21 for details. 

Fifth, we have three techniques to eliminate A-transitions from valence transducers 
while retaining the output in a commutative monoid. Here, we need one technique to 
show that if M is strongly A- independent, then M x Z is as well. This technique again 
uses the semilinearity of certain sets and a result that provides small preimages for 
morphisms from free commutative monoids to the integers. 

The second technique is to show that B is strongly A- independent. Here, we use 
a construction that allows the postponement of increment operations and the early 
execution of decrement operations. This is then used to show that one can restrict 
oneself to computations in which a sequence of increments, followed by a sequence of 
decrements, will in the end change the counter only by a bounded amount. 

The third technique is to show that if M is strongly A- independent, where M is 
nontrivial, then M * M is as well. Here, the storage consists of a stack of elements of M. 
The construction works by encoding rational sets over M * M as elements on the stack. 
We have to use the semilinearity results again in order to be able to compute the set of 
all possible outputs when elements from two given rational sets cancel each other out 
(in the sense that push operations are followed by pop operations). 
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4 Semilinear Languages 



This section contains semilinearity results that will be needed in later sections. The first 
lemma guarantees small preimages of morphisms from multisets to the integers. This 
will be used to bound the number of necessary operations on a blind counter in order to 
obtain a certain counter value. 

Lemma 4.1. Let tp : X® — )• Z be a morphism. Then for any n G Z, the set ip~ l (n) is 
semilinear. In particular, ker ip is finitely generated. Furthermore, there is a constant 
k G N such that for any fi G X® , there is a v C with fi G u + kerp and \v\ <k - \<p(fj,)\. 

Proof. It is clearly possible to construct a context-free grammar for the language L = 
{w G {a, b}* | \w\ a — \w\b = n}. Using an inverse homomorphism, one can then obtain a 
language K from L with ^(K) = ip~ l (n). Thus, </? _1 (n) is semilinear. 

In order to prove the second claim, we present an algorithm to obtain v from /u, from 
which it will be clear that the size of v is linear in the absolute value of y>(}i). Without 
loss of generality, let (p(fi) > 0. The algorithm operates in two phases. 

In the first phase, we construct a v C \i with piy') > ^(a 4 ) — 171 such that \u\ is 
linear in |y>(//)|, where m = max{|<^(x)| | x G X}. In this phase, we start with v = 
and successively add elements from \i to v until <p>(y) > <p(fJ>) — m - As long as we still 
have ip{y) < ^p(n) — m, it is guaranteed that we find an x G X such that v + x C fi 
and <p(x) > 0. Thus, after at most <p([J,) — m steps, we have y>{v) > pip) — Tn and 
\v\ < tp(fi) — m. 

Since we stopped after we first had ip(y) > <p(fJ,) — m, we also have <p(fJ>) — m< <p{y) < 
tp(n) + to. In the second phase, we successively extend v such that <p>(y) always stays 
within the interval [</?(/i) — m,ip([i) + to]: If <p{y) < V 9 (^)i we can fi n d aniel with 
v + x C fx and ip(x) > and if <p(v) > p(fJ-), we can find ani£l with v + x C /x and 
) < 0. We do this nondeterministically and can therefore assume that no value tp{v) 
occurs more than once: otherwise, we could have left out the summands between the 
two occurrences. 

Hence, the values ip{y) obtained in the course of the second phase are distinct numbers 
in [</?(//) — to, <p(p) + to] . Therefore, there is a computation in which after at most 2m + 1 
steps, we have p{v) = p{^)- Then clearly \v\ < (p(p) — to + 2to + 1. □ 

Another fact used in later sections is that languages in VA(M) are semilinear if M G C. 
This will be used in various constructions, for instance when the effect of computations 
(that make use of M as storage) on the output in a commutative monoid is to be realized 
by a finite automaton. We prove this using a result of van Leeuwen [23]. He showed that 
semilinearity of all languages in a family is inherited by languages that are algebraic over 
this family. A language is called algebraic over a family of languages if it is generated by 
a grammar in which each production allows a non-terminal to be replaced by any word 
from a language in this family. 

Definition 4.2. Let J 7 be a family of languages. An T -grammar is a quadruple G = 
(V, T, P, S) where V and T C V are alphabets and S G V \ T. P is a finite set of pairs 
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(A,M) with A G V \ T and M C V*, M G 7". In this context, a pair (A,M) G P 
will also be denoted by ^4 — )• M. We write x y if x = -wAv and y = uwv for 
some u,v,w G y* and (^4, M) G P with u; G M. The language generated by G is 
L(G) = {w G T* | S =>q u;}. A language L is called algebraic over J- if there is an 
P-grammar G such that L = L(G). 

We will use the following result by van Leeuwen, which appeared in |23| . 

Theorem 4.3 (van Leeuwen). Let P be a family of semilinear languages. Then every 
language that is algebraic over P is also semilinear. 

Note that in [19], a group G is called SLI-group if every language in VA A (G) is semi- 
linear (in different terms, however). Thus, the following recovers the result from |19| . 
that the class of SLI-groups is closed under taking the free product. 

Lemma 4.4. Each L G VA A (M * Mi) is algebraic over VA A (M ) U VA A (Mi). 

Proof. Let L = L(A) for some valence automaton A = (Q,X,Mq * Ml, P, qo, F). We 
assume that E = P U E\ with Ei C Q x X* X M» X Q and F = {<?/}. Let V{ = 
{Kp} q | p, q G Q} be a new alphabet for i = 0, 1. Furthermore, let Apj q be the automaton 
(Q, X U M, P|,p, {q}) with P< = P 4 U {(r, K^ l) , 1, s) | r, s G Q}. Finally, let A' be 
the automaton (Q, X U V U Vi,M * M x , E', q , F) with P' = E U Pq U E[. Then clearly 

L(4)=L(i')nr. 

We will construct an P-grammar G = (V,X,P,S) such that L(G) = L(A). This 
will be shown by proving that for any w £ V* , we have w G L(A') iff S =^q vj. Let 
V = X U Vo U V\ and P consist of the productions Kp\ — > L(Ap*^) for p,q £ Q and 
« G {0, 1}. Finally, let S 1 = K^ qf . By induction on n, one can see that S 1 =>q u) implies 
u; G i(j4')- I n particular, u; G £(£?) implies that w is accepted by A' without using the 
added edges in E' \ E. Thus, L{G) C L(A). 

Now suppose it; is accepted by A', which is witnessed by the edge sequence s = 
uqV\U\ ■ ■ -v n u n , where for some i G {0, 1}, we have uo G E'*, Uj G E[ + , Vj G Pj_lj for 
each j G {1, . . . , n}. 

We shall prove by induction on n that S =$>-q vj. For n = 0, we have w G L{Aq^ qf ) 

and thus either 5 = K q ®] qf =>c w oi S = K q ^] q . Kqllqf =^G Therefore, let n > 1. 
Let : P'* — > X*, -0 : P'* — >■ Mo * M\ be the morphisms with ip((p,x,m,q)) = x and 
tp((p,x,m,q)) = m. Then 99(5) = w and ^(s) = 1. Since ip(uj) G Mj and ?p(vj) G Mi_j, 
the definition of the free product yields that there is a j such that either ip(uj) = 1 or 
i/j(vj) = 1, where j > if uq = A. Let x G P /+ be this word Uj or u,-. Note that since 
ip(x) = 1, we have <£>(x) G L(Ap fc q) for k = i ox k = 1 — i (depending on whether x = Uj or 
x = Vj) and where p and g are the initial and final state of the sequence x, respectively. 
We can obtain a sequence s' from s by replacing x with the edge (p, Kp k q , 1, <?). Clearly, 
s' is a valid sequence in A' and by induction, we have S =>q tp(s'). Since w can be 

obtained from ip(s') by replacing Pp fe J with ip(x) G L(Ap fc g), we have S =3>* G tp(s') =3>g w. 

□ 
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Combining the latter lemma with van Leeuwen's result and a standard argument 
for the preservation of semilinearity when builing the direct product with Z yields the 
following. 

Lemma 4.5. Let M G C. Then for any language L G VA A (M) ; the set ^(L) is semilin- 
ear. 

Proof. Since VA A (1) and VA A (B) contain only context-free languages, the lemma holds 
for M = 1 and M = B. Furthermore, by Lemma 14,41 and Theorem 14. 3( we know that if 
VA A (M) contains only semilinear languages then so does VA A (M * B). Thus, it suffices 
to show that if every language in VA A (M) is semilinear then so is every language in 
VA A (Af x Z). 

Let L = L(A) for a valence automaton A = (Q,X,M x Z,E, qo,F) over M x Z. We 
assume that for each edge (p,x, (m,z),q) G E, we have x G X U {A}. We construct an 
automaton A' as follows. Let A' = (Q, XxE, M, E', go, F), where E' = {{p, (x, e), m, q) | 
e = (p, x, (m,z),q) G E}. Furthermore, let ip : (X x E)® —> Z be the morphism with 
f((x, (p, y, (m, z), q))) = z. Moreover, let it : (XxE)® — > X® be the projection map with 
7r((x,e)) = x. Then by the definition of acceptance, we have ^f(L(A)) = ir(^(L(A')) n 
99 _1 (0)). Since L(A') is semilinear by assumption, the class of semilinear subsets of a free 
commutative monoid is closed under intersection, and 9? -1 (0) is semilinear by Lemma 
14.11 L(A) is also semilinear. □ 

5 Membership Problems 

In this section, we study decidability and complexity of the membership problem for 
valence automata over MT. Specifically, we show in this section that for certain graphs 
r, the class VA A (Mr) contains undecidable languages (Lemma I5.5p . while for every V, 
membership for languages in VA(Mr) is (uniformly) decidable. We present two nonde- 
terministic algorithms, one of them uses linear space and one runs in polynomial time 
(Lemma 15. 4p . 

These results serve two purposes. First, for those graphs T that admit undecidable 
languages in VA A (Mr), it follows that silent transitions are indispensable. Second, if we 
can show that silent transitions can be removed from valence automata over Mr, the 
algorithms also apply to languages in VA A (Mr). 

The algorithms in this section rely on the convergence property of certain reduction 
systems. For more information on reduction systems, see [HE]. A reduction system is 
a pair (S, — >•) in which S is a set and — > is a binary relation on S. (5, — >) is said to be 
noetherian if there is no infinite sequence sq,s\, . . . with Sj — > Sj+i for each i 6 N. We 
write < — > (— >■) for the reflexive, transitive, symmetric (reflexive, transitive) closure of 
— >. (S, — >) has the Church-Rosser property if for any s,f £ S with s <—> t, there is a 
u € S with sAu and t A u. We say that (S, —>) is confluent, if for any s,t,u € S with 
s4t and s A u, there is a v G S with t A v and u A v . A noetherian and confluent 
reduction system is called convergent. Furthermore, (S, — >) is called locally confluent, if 
for any s,t,u G S with s — > t and s — > u, there is a v G S with f 4 d and u — > v. An 
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element s G S is irreducible if there is no t G S 1 with s — > t. We say t G S is a normal 
form of s 6 S 1 if s A f and i is irreducible. It is well-known that a reduction system is 
confluent if and only if it has the Church-Rosser property. Furthermore, a noetherian 
locally confluent reduction system is already confluent. 

One of the steps in our algorithms will be to check, given a word w G X£, whether 
w =r A. Unfortunately, turning the Thue system Tr into a reduction system on words 
will not yield a convergent reduction system as the length-preserving rules allow for 
infinite reduction sequences. Therefore, we will use reduction systems on traces instead. 
For more information on traces, see [3]. 

Let X be an alphabet. An irreflexive symmetric relation / C X x X is called an 
independence relation. To each such relation, the corresponding Thue system Tj = 
(X,Ri) is given as Ri = {(ab,ba) \ (a,b) G /}. If =/ denotes the congruence generated 
by T/, then the monoid T(X, J) = X*/=j is called trace monoid, its elements traces. 
The equivalence class of u G X* is denoted as [u]j and since the words in an equivalence 
class all have the same length, \[u]i\ = \u\ is well-defined. 

In order to efficiently compute using traces, we represent them using dependence 
graphs. Let X be an alphabet and I C X x X an independence relation. To each 
word w G X* we assign a loop-free directed acyclic vertex-labeled graph, its dependence 
graph dep(w). If w = x\ ■ ■ ■ x n , xi G X, 1 < i < n, then dep(u>) = (V,E,£), in which 
E C V x V, has vertex set V = {1, . . . ,n} and (i,j) G E if and only if i < j and 
(xi,Xj) ^ /. Furthermore, each vertex i is labeled with £(i) = x% G X. It is well-known 
that for words u, v G X*, we have n =/ v if and only if dep(n) and dep(u) are isomorphic. 
Thus, we will also write dep(s) for dep(n) if s = [u]i. 

Each (undirected, potentially looped) graph T = (V, E) gives rise to an independence 
relation on X-p, namely 

I = {(x, y)\xe {a v ,a v }, y G {a w , a w }, x^y, {v, w} G E}. (1) 

If I is given by T in this way, we also write =r|T f° r =i an d [u]r\j instead of [u]i. 

In the following, let / be given by T = (V, E) as in ([T]). We will now define a reduction 
relation — > on Y{Xy,I) such that for u, v G X^ 

[u]r = [v]r if and only if [tt]r|T HriT- (2) 

For s,t G T(Xr,I), let s — > t if there are m,«2 G and t> G V such that s = 
[uia v a v U2]r\T and t = [uiU2]r|T- This definition immediately yields ([2]). Since our 
algorithms will represent traces as dependence graphs, we have to restate this relation 
in terms of the latter. It is not hard to see that for s,t G T(X-p,I), s t if and only if 
there are vertices x,y in dep(s), labeled a v and a v , respectively, such that 

1. there is no path from y to x and 

2. there is no vertex lying on a path from x to y 

and dep(i) is obtained from dep(s) by deleting x and y. We will refer to conditions Q] 
and [2] as the subtrace conditions. 
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Figure 1: Possible fragments of the dependence graph of s. 



Lemma 5.1. The reduction system (T(Xr, I), — >) is convergent. 

Proof, ince the system is clearly noetherian, it remains to be shown that (T(Xt,I),—>) 
is locally confluent. Hence, let x,y,x',y' be vertices in dep(s) labeled a v ,a v ,a w ,a w , 
respectively, satisfying the subtrace conditions such that dep(t) is obtained by deleting 
x, y and dep(i') is obtained by deleting x',y'. If {x,y} = {x',y'}, we are done. Further- 
more, if {x,y} n {x',y'} = 0, deleting x,y from dep(i') (or x',y' from dep(t)) yields a 
u G T(Xr,I) with t — > u and t' — > u. Therefore, we assume x = x' and y ^ y' (the case 
x 7^ x', y = y' can be done analogously). This means in particular that v = w. Since 
(a v ,a v ) £ I, we can also assume that there is an edge from y to y' . 

If (a v ,a v ) ^ /, there are edges (x,y) and (x,y') in dep(s) and y violates the second 
subtrace condition of x,y' (see Figure [Ta|) . Hence, we have (a v ,a v ) E /. We claim that 
flipping y and y' constitutes an automorphism of dep(s), meaning dep(i) and dep(i') are 
isomorphic and thus t = t' . The former amounts to showing that each vertex z in dep(s) 
has an edge from (to) y iff z has one from (to) y'. 

If there is an edge from y to z, then by the definition of /, we also have an edge 
between x and z. Obeying the first subtrace condition, it has to be directed from x to 
z: Otherwise, there would be a path from y to x (see Figure [Tb|) . Since y and y' share 
the same label, we also have an edge between y' and z. If this were an edge from z to 
y', z would lie on a path from x = x' to y' (see Figure [Tc]), violating the second subtrace 
condition. Hence, there is an edge from y' to z. 

If there is an edge from z to y, then by the definition of /, we also have an edge 
between x and z. By the second subtrace condition, it has to be directed from z to x: 
Otherwise, z would lie on a path from x to y (see Figure [Tdjl . Since y and y' share the 
same label, we also have an edge between y 1 and z. If this were directed from y' to z, 
then there would be a path from y' to x = x' (see Figure [Te]). violating the first subtrace 
condition. Hence, there is an edge from z to y' . 

If there is no edge between y and z, there is also no edge between y' and z, since y 
and y' have the same label. □ 

By d2|) and since (T(Xr, /),—>) is convergent, we have 



This equivalence is the basis of our algorithms to check for the former condition. 

Lemma 5.2. There is a deterministic polynomial-time algorithm that, given a word 
w € X£, determines whether [w]r = [A]r- 



[w]r = [A]r if and only if [u>]r|T [A]r|T- 



(3) 
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Proof. By ([3]), the condition [w]r = [A]r is equivalent to [A]p|T being the normal form 
of [w]r|T- Therefore, our algorithm computes the normal form of [w]r|T- It does so by 
computing the dependence graph of w and successively deleting pairs of nodes that satisfy 
the subtrace conditions. Finding such a pair can be done in polynomial time and since 
at most \w\/2 deletions are possible, the normal form is obtained after polynomial time. 
In the end, the algorithm checks whether the calculated dependence graph representing 
the normal form is empty. □ 

Lemma 5.3. There is a nondeterministic linear-space algorithm that, given a word 
w € X£, determines whether [w]r = [A]r- 

Proof. Let T = (V,E). By ([3]), we have [w]r = [A]r if and only if w can be reduced 
to the empty word by commuting a v and a w for v, w € E, commuting a v and a v for 
{v} € E and deleting a v a v for v S V. Therefore, we can clearly construct a monotone 
grammar for the set of all w € that have [w]r = [A]r- O 

Lemma 5.4. For each L € VA(Mr), the membership problem can be decided by 

1. a nondeterministic polynomial-time algorithm as well as 

2. a nondeterministic linear-space algorithm. 

In particular, the languages in VA(Mr) are context-sensitive. 

Proof. In order to decide the membership problem for a word w for a language in 
VA(Mr), we can guess a run reading w. Since there are no A-transitions in the au- 
tomaton, such a run has length linear in |to|. For this run, we have to check whether the 
product of the monoid elements on the edges is the identity element of Mr. By lemmas 
I5.2l and l5.3l this can be done in polynomial time or using linear space. Hence, the lemma 
follows. □ 

Lemma 5.5. Let T be a graph whose underlying loop-free graph is a path on four vertices. 
Then VA A (Mr) contains an undecidable language. 

Proof. Let T = (V, E) and T be the graph obtained from T by adding a loop to every 
unlooped vertex. For notational reasons, we assume that the vertex set of T is V = 
{v | v € V}. Lohrey and Steinberg [19j show that there are rational sets R, S C MF 
over positive generators such that given a word w € WIT over positive generators, it 
is undecidable whether 1 € wRS^ 1 . Note that the morphism ip : WIT — > WIT with 
(p(a v ) = ag and (p(a v ) = a# induces an isomorphism between the submonoids generated 
by positive generators and between the submonoids generated by the negative generators. 
Thus, we find rational sets R,SQ WIF over positive generators with <p(R) = R and 
<p(S) = S. 

If w is a word over positive generators in Mr, w = a\ ■ ■ ■ a n , then we let w = a n ■ ■ ■ a\. 
This is well-defined, for if a% ■ ■ ■ a n = b\ ■ ■ ■ b m , for positive generators a±, . . . , a n , b±, . . . , b m 
then <p(a\ ■ ■ ■ a n ) = tp(pi ■ ■ ■ b m ) and thus (p(a n • * * oj) = <p(a\ ■ ■ ■ a n ) _1 = 99(61 • • • 6 m ) _1 = 
^(bm ■ ■ - bi) and therefore a n • • • a\ = b m - ■ ■ b\. Note that ww = 1 for every word w over 
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positive generators. With this definition, the set S = {s \ s G S} is also rational. 
We claim that for a word w G Mr over positive generators, 1 G wRS if and only if 
1 G (p^RS- 1 ^ 

If 1 G (p(w)RS^ 1 , there are r R, s & S with 1 = </?(u/)rJ _1 and thus s = tp(w)r. 
Thus, we can find s £ S and r G i? with 99(5) = (p(w)(p(r). The injectivity of y on words 
over positive generators yields s = wr and thus 1 = wrs. Hence 1 G wRS. 

If 1 G we have 1 = wrs for some r £ R and s E S. This implies 1 = 

</?(u')</7(r)(^(s)~ 1 and since ip(r) G i? and ^(s) -1 G we have 1 G ip(w)RS~ 1 . 

Thus, given a word u; G Mr over positive generators, it is undecidable whether 1 G 
wRS. Now, we construct a valence automaton over Mr that reads a representative 
of a word w and then nondeterministically multiplies an element from R and then an 
element from S. It accepts if and only if 1 G wRS. Therefore, the automaton accepts 
an undecidable language. □ 

6 Rational Sets 

Here, we present a normal form result for rational subsets of monoids in C. The first 
lemma is a simple observation for which we will not provide a proof. 

Lemma 6.1. Let s G M * B and s = m\a ■ ■ ■ m^amavn'-y • • • am' e . Then we have 

• s G J(M * B) if and only if G L(M), m G J(M), m^- G R(M) /or all 1 < i < k, 
!<3<t. 

• s G L(M * B) z/ and on/y if £ = and rn^rn G L(M) for 1 < i < k. 

• s G R(M * B) z/ and on/y if k = and m-, m G R(M) /or 1 < z < 

The following lemma states the normal form result. Note that in a valence automaton 
over M, we can remove all edges labeled with elements outside of J(M). This is due to 
the fact that they cannot be part of a valid computation. In a valence transducer over 
M with output in C, the edges carry elements from X* x M x C, in which M is used 
as a storage. Therefore, of a rational set S C M x C, we will only be interested in the 
part Sn(J(M) x C). 

If A = (Q, M, E,q , F) is an automaton over M and B = (Q', M, E', q' , {q'f}) is an 
automaton over M with only one final state and QCiQ 1 = 0, then the automaton obtained 
by gluing in B between p, q G Q is C = (Q U Q' , M, qo,F), where 

£" = £U£'u{(p,l,^),(g,l,a})}. 

Lemma 6.2. Lei M G C and C be a commutative monoid and S C M x C a rational 
set. Then we have S n (J(M) x C) = (J" =1 UUiRi, in which 

(i) Lie RAT(L(M) x C), 

C/i G RAT(H(M) x C), and 
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(in) Ri G RAT(R(M) x C) 



for 1 < i < n. Moreover, 

S n (L(M) x C) = U LiUi ' ( 4 ) 

l<i<n, leRj 

5n(R(M)xC)= |J 17^. (5) 

l<i<n, leLi 

Proof. We proceed by induction. Since the lemma clearly holds for M = 1, we show 
that if it holds for M, it is also true for MxZ and M*B. Therefore, suppose the lemma 
holds for M. 

Let 5 C (M x Z) x C be a rational subset. We regard S as a subset of M x (Z x C). 
By induction, we have S n (J(M) x (Z x C)) = UiLi LiUiRi with the properties above. 
Since X(M x Z) = X(M) x Z where X is any of the operators J,H,L,R, the sets Li, Ui,Ri 
can serve as the desired decomposition for (M x Z) x C. 

Let S C (M * B) x C be rational. Then there is an alphabet X, a rational language 
L C X*, and a morphism 99 : X* -)■ (M * B) x C with y?(L) = 5. Without loss of 
generality, we assume that X = {x, x} U Y U Z with (^(x) = a, = a, <p(Y) C M, 
</?(Z) C C, where a and a are the two generators of B. Let A be an automaton accepting 
L such that every edge carries exactly one letter. 

As a first step, we will construct an automaton A' that also has ip(L(A')) = S but 
which has for every element of S n (J(M * B) x C) a representative in X* \ X*xX*xX* . 

Let j4 = (Q, X, E, qo,F). For p,q £ Q, the language 

Kp, g = {kz{w) I w e L p>q (A), <p(w) G {1} x C} 

is clearly contained in VA A (M*B) and is therefore semilinear by Lemma l4.51 Thus, we can 
find a finite automaton A' pq such that ^f(L(A' pq )) = ^>(K PA ). Since C is commutative 
and tp(Z) C C, this also means (p(L(A p q )) = ip(K p>q ). The automaton A' is now obtained 
from A by gluing A' p q into ^4 between p and g, for each p, q € Q. Since in A' for each 
path from the initial to the final state, we can find another path that encodes the same 
element of (M*B) x C and is present in A, we have ip(L(A')) = (p(L(A)) = S. However, 
the glued in automata allow us to encode elements of S H (J(Af * B) x C) by words of a 
certain form. Specifically, we claim that 

S n (J(M * B) x C) C <p(L(A') \ X*xX*xX*) C 5. (6) 

Let s £ Sfl (J(Af * B) x C) and w G L(A') be chosen such that (p(w) = s and |u;|x\z 
is minimal. Toward a contradiction, suppose w G J*sJ*sX*. Then w = fxgxh with 
/, h e X*, g £ (Y U Z)*. Since ama ^ J(M * B) for any m G M \ {1}, our assumption 
s G J(M*B) x C implies ^(p) G {1} x C and thus ip(xgx) G {1} x C. By the construction 
of A 1 , however, this means that there is a word v G Z* such that /u/i G and 
Lp(fvh) = tp(w). Since < IHx\z> ^ n ^ s contradicts the choice of w, proving (JOJ) . 



16 



Let A" = (Q", X, E", cfi, {q" f }) be a spelling finite automaton accepting L(A')\X*xX*xX* 

with input alphabet X. Furthermore, for each p, q G Q" , let B p ~\ Bp®q, Bq + ^ be spelling 
automata satisfying 

L(B^) = L q ,, !p (A")n((YUZyx)* 
L(Bj°l) = L p , q (A") n (Y u zy 
L(BM) = L q ^(A")n(x(YUZ)r 

Then we have 

L(A")= |J L(B(-))L(BW)L(B(+)). (7) 
p.^eQ" 

Since 5 n (J(M *B) xC) C <p(L(A")) C S, we will now modify A" so as to accept 
only words whose image lies in S n (J(M *B)xC). This will be achieved by exploiting 
the induction hypothesis for M x C. From B p ~\ we obtain Bp~^ by removing all edges 
with letters inYUZ and then between any two states r, s gluing in an automaton that 
accepts a rational language with image 

<p(L rta (Bj-)) n (Y u Z)*) n (L(M) x C), 

which exists by induction. The automaton -Bq 4 ^ is obtained from B q + ^ in an analogous 
way: we remove edges with letters in Y U Z and between any two states r, s, glue in an 
automaton that accepts a rational language with image 

V (L r , s (B^) n (Y U Z)*) n (R(M) x C). 

By the induction hypothesis, we can assume that the input alphabet of Bp ^ and B q + ^ 
is {x} U y(~) U Z and {x} U I'M u Z, respectively, where <p(Y (-) ) C L(M) x C and 
p(yW) C R(M) x C. Let = ip(L(B p ~^)), K { q +) = <f{L{B { p +) )). Then G 

RAT(L(M * B) x C), K ( q +) G RAT(R(M * B) x C) and 

= <p(L(B^)) n (L(M * B) x C), 
KM = n (R(M * B) x C). 

Finally, the induction hypothesis provides for each p,q G Q" rational sets G 
RAT(L(M) x C), G RAT(H(M) x C), R p ^i G RAT(R(M) x C) such that 

n(p,g) 

^(L(4°))) n (J(M) x C) = |J L^,^,,,^,^. 

i=i 

We claim that for S = S n (J(M * B) x C), we have 

5 = U [ K p~ )l p,w] U p^ [ R P^ K i +) } ■ ( 8 ) 

l<i<n(p,g) 
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Thus, let s G S. By ©, the definition of A", and ((7J), we can write s = with 

W = U\X ■ ■ ■ UkXVXWl ■ ■ ■ XW£ 

and Uj, v, Wj G (YUZ)* for 1 < i < k, 1 < j < £ and uix • • • u^x G L(B P '),»£ L(B^l), 
xwi ■ ■ ■ xvj£ G L(Bq + ^) for some p,g € Q". By Lemma [67T|, the fact that s € J(M*B) x (7 
implies that (p{m) G L(M) x C, (p(v) G J(M) x C, and <p(m,-) G R(M) x C for 1 < % < k, 
1 < i < ^- In particular, there are words Uj G (Y^ - ) U Z)*,Wj G (YW U Z)* such that 

(p(ui) = (f(ui), <p(wj) = ip(wj) and 

u\x ■ ■ • ttfcX G L(B^), xw\ ■ ■ ■ xw£ G L(Bjj +S) ) 

and tp(v) G £p,<j,ifp, g ,i-Rp, g ,j for some 1 < i < n(p,q), proving "C" of (JSj) - 

For the inclusion "5", note that by §6§ and the definition of the sets on the right, the 

right side is contained in S. Moreover, by Lemma 16.11 the right side is also contained in 

J(M*B)xC. This proves ©. 

We will show that ([8]) is the desired decomposition of S. We have already established 

that 

(i) K^L^ G RAT(L(M*B) x C), 

(ii) U p ^i G RAT(H(M) x C) C RAT(H(M * B) x C), and 

(iii) Rp Ati K^ +) G RAT(R(M*B) x C). 

Therefore, it remains to be shown that (jJJ) and © are satisfied. We only prove (jJJ), the 
proof for ([5]) can be done analogously. The inclusion "5" is immediately clear in each 
case. Thus, suppose s G S n (L(M * B) x C). Then there is a word w G L(A") with 
s = v?(u>) and 

tt> = U%X ■ ■ ■ UkXVXWl ■ ■ ■ XW£ 

and Ui,v,Wj G (Y U Z)* for 1 < i < k, 1 < j < £ and u\x • • • UkX G L(B P ),i)E L(Bp°l), 
xw\ ■ ■ ■ xwg G L(Bg + ^) for some p, g G Q". By Lemma [6. 11 the fact that s G R(M*B) x C 
implies <p(ui), <p(v) G L(M) x C for 1 < % < k and £ = 0. In particular, there are words 
Ui G U Z)* such that (f(ui) = (f(ui) and 

u\x ■ ■ ■ UkX G L(B p ~^), \eL(B { q +) ) 

and <p(v) G Lp^^Up^j and 1 G -Rp, g ,i for some 1 < i < n(p, q) by induction. Therefore, 

1 G R Pj q t iKg + ^ and s G i£p Lp,q,iU Pi q } i. This proves the remaining inclusion "C" of (jH). 

□ 

If we regroup the factors in ([8]), we obtain 

S — [^J Kp ' \Lp y q,iUp^q^Rp^q^i\ Kg \ 

1 <i<n(p,q) 

which implies the following corollary. Note, however, that it can also be deduced from 
Lemma 16.21 
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Corollary 6.3. Let M 6 C and S be a rational subset of (M * B) x C. Then there is 
an alphabet X = {x, x} U Y U Z , a morphism (p : X* — > (M *B)xC with tp(x) = a, 
tp(x) = a, <p(Y) C M , (p{Z) C C, a number ti£N, and rational languages 

LiC({YUZ)*xY, JiQ(YUZ)*, Rt Q (x(Y U Z)*)* 

such that 

n 

S n (J(M *M)xC) = {J (p(Li)<p(Ji)<p(Ri). 
i=i 

7 Silent Transitions 

In this section, we use the facts established in earlier sections to prove the main results. 
Lemma 7.1. Let T be a graph such that 

• between any two looped vertices, there is an edge, and 

• between any two unhoped vertices, there is no edge, and 

• r does not contain • * * • as an induced subgraph. 

Then WIT is in C. 

Proof. We proceed by induction and thus assume that M(T \ {x}) S C for any vertex x. 
Let r = (V, E) and write V = LL)U, where L is the set of looped vertices and U is the 
set of unlooped vertices. For every x G L, let v(x) = N{x) n U, i.e. the set of unlooped 
neighbors of x. We write x < y for x,y G L if v(x) C v{y). Clearly, < is a reflexive, 
transitive order on L. 

If there were x, y S L such that v{x) and are incomparable, there would be 
vertices u,v S U with u S \ z/(y) and v S ^(y) \ v{x). Thus, the vertices u,x,y,v 
induce the subgraph • — * — 8 — ■ , contradicting the hypothesis. Hence, < is a total order 
and has a greatest element g € L. 

• If v{g) = U, then g is adjacent to every vertex in T and thus Mr = M(T \ {g}) x Z. 

• If v(g) C U, then there is an isolated vertex u € U \ v{g)- Hence, we have 

Mr^M(r\H) *b. 

□ 

We will prove Theorem O by showing that that VA(M) = VA A (M) for every MeC. 
This will be done using an induction with respect to the definition of C. In order for 
this induction to work, we need to strengthen the induction hypothesis. The latter will 
state that for any M € C and any commutative monoid C, we can transform a valence 
transducer over M with output in C into another one that has no A-transitions but is 
allowed to output a semilinear set of elements in each step. Formally, we will show that 
each M £ C is strongly X-independent. 
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Definition 7.2. Let C be a commutative monoid and T C X* x SL(C) be a transduction. 
Then $(T) C X* x C is denned as 

$(T) = {(w, c) G I* x C 3(w, 5)eT:c65}. 

For a class T of transductions, ^(J 7 ) is the class of all <fr(T) with T € J 7 . 

A monoid M is called strongly X-independent if for any commutative monoid C, we 
have VT A (M,C) = $(VT(M,SL(C))). 

Note that the inclusion $(VT(M, SL(C))) C VT A (M, C) holds for any M and C. Here, 
in order to have equality, it is necessary to grant the A-free transducer the output of 
semilinear sets, since valence transducers without A-transitions and with output in C 
can only output finitely many elements per input word. With A-transitions, however, a 
valence transducer can output an infinite set for one input word. 

By choosing the trivial monoid for C, we can see that for every strongly A-independent 
monoid M, we have VA(M) = VA A (M). Indeed, given a valence automaton A over M, 
add an output of 1 to each edge and transform the resulting valence transducer into a 
A-free one with output in SL(1). The latter can then clearly be turned into a valence 
automaton for the language accepted by A. Thus, we have the following lemma. 

Lemma 7.3. If M is strongly X-independent, then VA(M) = VA A (M). 

Definition 7.4. A rationally labeled valence transducer over M with output in C is an 
automaton over X* x RAT(M x C). For A = {Q,X* x RAT(M x C),E,q ,F), we also 
write A = (Q, X, M, C, E, qo,F). The transduction performed by A is 

T(A) = {(w, c) € X* x C | 3q € F : (go, (A, {1})) (q, (w, S)), (1, c) G S}. 

A is called spelling if E C Q x X x RAT(M x C) x Q, i.e., if it reads exactly one letter 
in each transition. 

The definition of T(A) for rationally labeled valence transducers A means that A be- 
haves as if instead of an edge (p, (w, S),q), S € RAT(M x C), it had an edge (p, w, m, c, q) 
for each (m, c) <G S. Therefore, in slight abuse of terminology, we will also say that 

(xi,mi,ci) (x n ,m n ,c n ) 

qo > qi -> > <?n-i > q n 

is a computation in A when there are edges (tfe_i, (x^, Si),qi) £ E such that (mj, q) € 
for 1 < i < n. 

Lemma 7.5. For each valence transducer A over M with output in C, there is a spelling 
rationally labeled valence transducer A' with T(A') = T(A). 

Proof. Let A = (Q,X,M,C,E,qo,F). We obtain the A-free rationally labeled valence 
transducer A' = (Q,X,M,C,E f ,qo,F) as follows. We introduce one edge (p, (x, S),q) 
for every triple (p,x,q) G Q x X x Q such that S C M x C is the rational set of elements 
spelled by paths in A that start in p, go along a number of A-edges, then pass through 
an edge labeled x and then again go along a number of A-edges and stop in q. Then 
clearly T(A') = T{A). □ 
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Lemma 7.6. B is strongly X-independent. 

Proof. Let T G VT A (B, C). By LemmaESl we can assume that T = T(A) for a rationally 
labeled valence transducer A = (Q, X, B, C, E, go, F) over B with output in C. 

By Lemma 16,21 we can assume that every edge in A has the form (p,x,LR,q), with 
L G RAT({a}® x C) and R G RAT({a}® x C). Furthermore, we can assume that 
edges starting in the initial state qo are of the form (qo,x, R,p) and, analogously, edges 
ending in a final state 5 £ F are of the form (p,x,L,q), p G Q, L G RAT({a}® x 
C) and i? G RAT({a}® x C). Thus, we can construct an equivalent transducer A' = 
(Q r , X, B, C, E',qo, F') each edge of which simulates the i?-part of one edge of A and then 
the L-part of another edge of A. Hence, in A', every edge is of the form (p,x, RL,q) 
with p,q£Q', R£ RAT({a}® x C), and L G RAT({a}® x C). 

Since {a}® x C and {a}® x C are commutative, all such R and L are semilinear sets 
and we can even assume that every edge is of the form (p,x,R®(m,c)L®,q), in which 
(m, c) G B x C and R and L are finite subsets of {a}® x C and {a}® x C, respectively. 

Now the first crucial observation is that if we allow the transducer to apply elements 
of {a}® x C that, in an edge (p, x, R®{m, c)L® , q) traversed earlier, were contained in R, 
we do not increase the set of accepted pairs in X* x C. This is due to the fact that if the 
counter realized by B does not go below zero in this new computation, it will certainly not 
go below zero if we add the value at hand in an earlier step. Thus, any computation in 
the new transducer can be transformed into one in the old transducer. Furthermore, the 
commutativity of C guarantees that the output is invariant under this transformation. 
Analogously, if we allow the transducer to apply elements from {a}® x C, as long as it 
ensures that in some edge (p,x,R®(m,c)L®,q) traversed later, they are contained in L, 
we do not change the accepted set of pairs either. 

Therefore, we construct a rationally labeled transducer A" from A' . In its state, A" 
stores a state of A' and two sets: a finite set R C {a}® x C and a finite set L C {a}® x C . 
R always contains all those elements of {a}® x C that have occurred in sets R so 
far, and L are elements of {a}® x C that still have to be encountered in sets L in 
the future. Then for every edge (p,x,R®(m,c)L®,q) in A', we have an edge labeled 
(x,(R U R)®(m,c)(L U L)®). A" will then add the elements of R to its set R and 
nondeterministically remove some elements of L from L (they can only be removed if 
this is their last occurrence; otherwise, we might need them in L later). The final state 
will then make sure that L is empty and A" has thus only applied elements early that 
would later appear. In the initial state, both sets R and L are empty and then L is filled 
nondeterministically. 

We have constructed A" to have the following property. For every computation 

(xi,_Rf(mi,ci)L®) (x n ,R®{m n ,c n )L®) 

qo > qi ■ ■ ■ q n ~i > q n , 

we have 

#1 C R 2 C ■ ■ ■ C Rn and L\ D L 2 5 • • • 5 L n . 

The essential idea of the proof is that in A", we can accept any pair of X* x C by a 
computation such that the element in the i?®-part and the element in the L®-part in 
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each edge differ in length (as measured by the number of a and a) only by a bounded 
number. If they differ by more than the bound, either some part of the R®-p&rt or some 
part of the L®-part can be postponed or applied earlier, respectively. 

If then we know that these lengths differ only by a bounded number, we will see 
that for each occurring difference (between the lengths), the set of possible outputs is 
semilinear. Thus, we only have to output this semilinear set and add this difference. 

The valence transducer A is obtained from A" as follows. Let e = (p, x, R®(a k a n , c)L®,q) 
be an edge in A". Let Y and Z be alphabets in bijection with R and L, respectively, 
and let (p : (YUZ)® — > B x C be the morphism extending these bijections. Furthermore, 
if k : B — > Z is the morphism with k(o) = 1 and n(a) = —1, let i[) : (Y U Z)® — > Z 
be defined by VKaO = ^^li^if 1 )))- The set Cj = ^(ifOj" 1 (i))) ^ C now contains all 
outputs c\C2 G C such that there are (a*,ci) G R® and (a u ,C2) G L® with t — u = i. 
Moreover, by Lemma |4.1| the set Cj is semilinear. Let 

b = min{— 1, ip(z) + n — k \ z G Z}, 5 = max{l, tp(y) + n — k \ y G y}. (9) 

A has the same set of states as A" . To simulate the edge e, we introduce for each i G N 
with b < i < B the edge 

(p,x,a k+i a n ,cCi,q) if i > 0, (10) 
(p.x.a^-VC^g) if z < 0. (11) 

Initial state and final states remain unaltered. We claim that < 5(T(^4)) = T{A"). By 
the construction, it is clear that $(T(^4)) C T(A"). Now consider a computation in A" 

with steps p — : ■ — > q for edges (p,x,R®(a a n ,c)L® ,q). Define (p, k, tfj, m, M as 

above. Let r = <p(n) and i = (p(f), [i G F®, f G Z® . 
Suppose there is a y in /x such that 

- y) + A; - n + > 0, (12) 

that is, the counter stays above zero until the end of the step, even if we do not add y. 
Then the counter will also stay above zero if we postpone the application of <p(y) until 
the beginning of the next step. By construction, A" allows us to do so. Note that we 
cannot be in the last step of the computation, since this would leave a positive value on 
the counter. Analogously, suppose there is a z in v such that 

- tj}(u - z) + n - k - iJj(ij,) >0, (13) 

that is, when starting from the right (and interpreting a as increment and a as decre- 
ment), the counter does not drop below zero until the beginning of the step, even if we 
do not apply <p(z). Then we can apply ip(z) earlier in the computation. Again, note 
that this cannot happen in the first step, since this would mean the computation starts 
by subtracting from the counter. 

We transform the computation in the following way. Whenever in some step, (|12j) is 
satisfied, we move cp(y) to the right (i.e., we postpone the application of <p(y)). Sym- 
metrically, whenever in some step, (|13j) is fulfilled, we move <p(z) to the left (i.e., we 
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apply (p(z) earlier). We repeat this and since the computation is finite, this process will 
terminate and we are left with a valid equivalent computation in which (|12|) and (|13p do 
not occur. 

The equations (fT2|) and (fT3|) are equivalent to 

+ if>(u) > ip(y)+n-k, 
ip(/j,) + ip(v) < %l){z) J r n — k. 

Since these are not satisfied, we have 

+ < "0(y) + n — k for each y in fx, (14) 

-(/'(/z) + > ip(z) + n — k for each z m. v (15) 

and thus 

b < V>(/x) + V^) < B. 

Note that these inequalities follow from (|15j) and (|14|) . respectively, if 7^ and (J,^ 0. In 
case // = or v = 0, they still hold because then tp(/J,) + tp(v) is < or > 0, respectively, 
and b < and < B. This means, however, that each step has a counterpart in the 
edges CED and (HU) . Therefore, $(T(i)) = T(A"). □ 

Lemma 7.7. Suppose M & C is strongly X-independent. Then M x Z is strongly X- 
independent as well. 

Proof. In order to simplify notation, we write the operation of C with +. Let T G 
VT A (M x Z, C) and let J 4 = (Q,I,Mx Z, C, g , ^) be a transducer for T. By letting 
E' = {(p, x, m, (z, c), q) \ (p, x, (to, z), c, q) G i?}, we get a transducer ^4' = (Q, X, M, Z x 
C, E' , qo, F). Then we have (w,c) G T if and only if there is a (w, (0, c)) G T(^4'). By 
the hypothesis, there is a A-free valence transducer ^4" over M with output in SL(Z x C) 
such that $(T(A")) = T(A'). 

In j4", every edge is of the form (p, x, m, S, q), where S C Z x C is semilinear. Thus, 
we can assume that every edge is of the form (p, x, to, (£, c) + 5®, q), where SCZxC 
is finite. Since Z x C is commutative, we do not change the transduction if we output 
elements s G Z x C that occur in some 5" in a step anywhere else in the computation. 
Therefore, we can transform A" so as to make it guess the set S of all s G Z x C that 
will occur in an S somewhere in the computation. It uses its finite control to guarantee 
that the computation is only accepting if all elements of S actually occur. In every step, 
it allows the output of every element of S® . Thus, in the resulting transducer A'", we 

r, ■ (x,m,(e,c)+S®) 
have that m any computation, the set b in steps p > q does not change 

throughout the computation. 

Our new transducer A has the same set of states as A'" and the edges are defined as 

follows. For the edge (p, x, m, (£, c) + S®,q) in A'", let Y be an alphabet in bijection 

with S and let (p : Y® — >■ Z x C be the morphism extending this bijection. Let k G N 

be the constant provided by Lemma I4TT1 for the map ifi : Y® — > Z, ip(fj-) = tt± {ip{n)). We 

introduce an edge 

(p, x, (to, £ + ip(u)),c + 7T2(<p(v + ker ip)),q) 
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for every v G Y® with \v\ < k ■ B, where B is the maximum over all values \i\ for edges 
(p' , x' , m' , (I, c') + S®, q') in A'" . Initial and final states remain unaltered. Note that by 
Lemma |4.1| the set c + TT2(<p(v + ker %[))) C C is semilinear. Observe that each of these 
edges chooses an element of S®, namely a fj, G v + ker -ip, and adds i + tp{v) = i + ip(fJ-) 
to the Z-component of the storage and outputs c + Tr2(ip(fJ-)). Thus, it simulates a step 
in A'". Therefore, if (w,c) G $(T(i)), then (u>, (0,c)) € <&{T(A'")) and thus (io,c) G T. 

It remains to be shown that (w, (0,c)) G §(T(A"')) implies (w,c) G $(T(1)). There- 
fore, Let 

)+s„) 

qo > qi--- q n -i > q n 

be a computation in A'" that witnesses (w, (0, c)) G <&{T{A"')). Let Sj G S® for 1 < i < n 
and define Y, (p, ip, k, B as above. Let Sj = tp(fJ-i), fJ-i G Y®. Since the computation accepts 
(w, (0, c)), we have + ' ' ' Mn) + + ' ' ' = and for ^ = /ii + • • • + [i n we have 
thus 

\i}>(ji)\ = + ■■■ +/x n )| = |^i + ■■■+41 <n-5. 

Lemma l4.1l now yields ai/C/j with /i£v + ker ^ and |z^| < knB. This means that we 
can write v = v\ + • • • + v n such that \v{\ < kB for 1 < i < n. Since + ker?/>, we 

have Si G + ker if)) and 

*i + V>M + ■ ■ ■ + ?n + ^K) = h + ' ' ' + 4 + ^(/x) = 0. 
Thus, using the edges 

(a;i,(mi,£i+i/>(i/i)),ci+7r2((/j(^i+keri/)))) (a; n ,(m n / n +i/)(^ n )),c„+7r2(¥'(i/n+ker ?/'))) 

go > qi ■ ■ ■ q n -i > q n , 

we have (w,c) G $(T(i)). □ 

Lemma 7.8. If ip : M N is a morphism with </? _1 (l) = {1}, then VT(M, C) C 
VT(iV, C) for any monoid C . 

Proof. Take a valence transducer over M with output in C and replace each edge 
(p, x, m, c, q) by (p, x, (p(m),c, q). This yields a valence transducer over iV that performs 
the same transduction. □ 

Lemma 7.9. Suppose M EC is non-trivial and strongly X-independent. Then M * B is 
strongly X-independent as well. 

Proof. By Lemma 17.51 i n order to show T G <&(VT(M * B,SL(C))) for any given T G 
VT A (M*B,C), we can assume that T = T(A) for a rationally labeled valence transducer 
A over M * B with output in C. Without loss of generality, we can assume that in 
A = (Q,X,M * M,C,E,q ,F), we have E C Q x X x RAT((M * B) x C) x Q and 

= {?/}• 

First, we claim that VT(M * B<»,C) = VT(M * B,C) for any commutative C. In 
fact, since M G C is non-trivial, it contains an element 6 G R(M) such that ¥ ^ V for 
* 7^ J) j ^ N. Let 66 = 1 and let ai,oT, . . . ,a n ,a^ be the generators of the factors 
B in M * B("), respectively. Then the map (p : M * B( n ) — > M * B, with (^(ai) = ob^a, 
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<p(ai) = ab l a, (p{m) = m, for 1 < i < n and m G M, clearly satisfies ip 1 (1) = {1}. 
Thus, by Lemma E3 we have VT(M * l^, C) = VT(M * B, C) and it will suffice to 
show 

T(A) e $(VT(M * B (n) , SL(C))) 

for some n G N. 

By Corollary 16.31 we can assume that for every edge (p,(x,S),q) G E, there is an 
alphabet X' = {x,x} UY II Z, a morphism ip : X'* (M *l)xC with ip(x) = a, 
tp(x) = a, (p(Y) C M, (p(Z) C C, and rational languages 

l c ((y uz)*x)*, jc(yuz)*, c (x(y u zyy (16) 

such that 5" n (J(M * B) x C) = (p(L)<p(J)ip(R). Indeed: for those edges where the 
union provided by Corollary 16.31 ranges over more than one set, we can introduce new 
edges. Without loss of generality, we assume that the alphabets X',Y,Z are the same 
for all edges (p, (x, S),q). If we replace the edge (p, (x, S), q) by (p, (x, <p(LJR)),q), we 
do not change the transduction, since elements outside of J(M * B) cannot occur in a 
product that results in 1. Therefore, we assume that every edge of A is of the form 
(p, (x,tp(LJR)),q) as in (fT6j) . 

In order to be able to denote several appearing rational sets using a pair of states, we 
construct finite automata 

B^ = (Q(-\X',E(-\q ,<b), 

= (Q(°),YuZ,£(°),go,0), 
£(+) = (Q(+U',£(+U,0) 

such that for each edge (p, (x, ip(LJR)),q) G E, we have L = L rjS J = L t ^ u (B^), 
and R = L VjW (BM) for some states r, s G \ t,u G Q^°\ v,w G Because of 

([16]) , we can assume that in these automata there are subsets QH C Q(~), Q(+) C Q(+) 
such that in B^~\ an edge is labeled x if and only if it enters a state in Q(~^ and an 
edge in B^ is labeled x if and only if it leaves a state in Q^ + \ For each r, s G 
t,u £ , v,w E QW, let 

L r , s = p(L ri ,(flH)) ] J tiU = ^L tiM (B( ))), V = #»,»(5 (+) )), 
L rs = {<^(V) | to G (Y U Z)*, wi G L rjS (5 (_) )}, 
£«,,«, = {^M | to G (Y U Z)*, G L VyW {B^)}. 

By (fT6|) . every edge in A is of the form (p, (x, L rtS J tjU Rv,w), q)- 

The essential idea of the construction is to maintain a representation of a set of possibly 
reached configurations. Roughly speaking, we represent a sequence of rational subsets 
of R(M *B) x C by elements of M *M^ n \ Now in order to simulate the multiplication of 
a set of the form L r>s , we have to output a set of elements of C that appear as output 
while canceling out elements on the stack with those in L TjS . Therefore, we will output 
sets of the form 

^v,w,r,s — G C | (1, c) G RvjwEf,s\ • 
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By Lemma |4.5| these sets are semilinear. 

In the course of a computation, we will have to simulate the multiplication of rational 
subsets of M x C. To this end, we will use the hypothesis of M being strongly A- 
independent in the following way. Assume that 

w = x qH u q(°> x q(°) u q« x q(+) 

is an alphabet. Let D = ({q},W,M,C,E' ,q,{q}) be the rationally labeled valence 
transducer over M with output in C with the following edges: 

• for each r, s G Q^ - -', create a loop on g with input (r, s) € W and label L rjS , 

• for each t,u G create a loop on g with input (t,u) G and label J^ u , and 

• for each v,w G Q^ + s create a loop on g with input (v,w) G and label -R^. 

Since M is strongly A- independent, we can transform D into a A-free valence transducer 
D = (Q,X,M,Sl(C),E,q ,F) over M with output in SL(C) such that $(T(D)) = 
T(D). 

As mentioned above, we will encode rational subsets of R(M * B) x C by elements of 
M * B( n ). The monoid structure of M * B( n ) allows us to use the positive generators of 
the n instances of B as stack symbols. Specifically, for every pair of states v,w G Q*- , 
we will have a symbol A° m that represents the set Rv )W . 

First suppose that sets Jt )U do not occur on edges. When an element of R v ^ w is 
completely canceled out by an element of L rjS , then we have to output an element of 
Cv,w,r,s- Therefore, when A° m is on top and we want to simulate the multiplication of 
L r)S , we output the semilinear set C V:Wt r )S and remove A° w . 

By construction, composing an element of R V)W with one of L TyS always yields one 
whose first component is in some iri(Rv tW >), w' G or one outside of J(M * B) x C. 
Therefore, in order to simulate a computation where an element of L TyS cancels out only 
part of an element of R VtW , we have a split operation, which removes a symbol A° w 
from the top and puts A° W ,A^, w in its place, so that the simulation of L r ^ s can then 
cancel out A^, w and output C w / >w>rtS . Note that the set C w > tW>r!S contains only the 
outputs for those compositions where the elements actually cancel out. In particular, 
those compositions that yield elements outside of J(M * B) x C provide no output in 

In order to simulate an element of L r ^ s that cancels out an element in the composition 
of the two topmost rational sets, we need a way to merge two representations of rational 
sets. However, if we would merge two representations of rational subsets of R(M*B) x C 
into one, the resulting representation would not be of the form A° w , since it has to keep 
track of what states have to be visited on the way. Furthermore, the more representations 
we would merge, the more information we would have to maintain. 

Therefore, we will not merge representations of the form A^. Instead, we have 
another kind of symbols: the symbol A* s stands for an element of R(M*B) x C that can 
be canceled out by one of L T)S . Furthermore, the occurrence of such a symbol also implies 
that the corresponding output of the canceling process has already been performed. This 
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means, the symbol A* s is produced by an operation cancel that removes A° w , places 
A* s on top and outputs C v>WjrjS . Since C is commutative, this early output does not 
change the result. The merge operation then consists of removing A* s A* s , and putting 
A* s , in its place. Since we will always be able to assume that a symbol A° w has already 
been turned into a A* s , we can always simulate the application of a set L r s by removing 

Finally, we have to simulate the application of sets Jt jU . To this end, we use an edge 
(p,(t,u),m, S,q) in the transducer D. The state information of D is then stored in 
symbols D p on the stack. Thus, we simulate Jt )U by removing \D p from the stack, using 
S as output, and adding md q on the stack. 

In order to let elements of M that are factors of elements in R v , w , i.e., elements of 
R v , w , interact with sets J r>s , we have two further operations: convert-to and convert- 
from. Convert-to-M removes an element A° w from the stack and instead adds DrnDg 
on the stack and outputs S, where (qo,(v,w),m,S,q) is an edge in D. That is, the 
element represented by A° jU) can be thought of as being handed over to D. Here, □ 
represents the a that was part of Rv, w , but not of R V:W . Thus, convert-from-M initiates 
a subsequence of stack elements that simulate a computation of D. On the other hand, 
convert-from-M will terminate such a subsequence by simulating the multiplication of 
a set of the form L r ^ s . It removes d q , adds m, removes □, adds A* s , and outputs S, 
where (q, (r, s), m, S, qj) is an edge in D and qj is a final state of D. 

Formally, let be the alphabet 

e = {A° V:W ,A; :S ,a q ,a\v,wGQ (+ \ r,se&-\ g eQ} 

and let n = |0|. We let each of the symbols i£0, together with its counterpart x, be 
the generators of one of the instances of B in M * M^ n \ Sometimes, it is necessary to 
apply one of the aforementioned operations not on top of the stack, but one one symbol 
below the top. Therefore, for the operations split, merge, and cancel, we have a deep 
variant, which nondeterministically removes some x € 0, then performs the original 
operation and then puts x back on top. 

Formally, an operation is a (finite) set of elements of (X U {A}) x (M * M^) x SL((7). 
In accordance with the explanation above, we have the following operations: 

. split = {(A,A°^A° y A° >5 {1}) | v,v',we 

• deep-split = {(A, xsx, S) \ x G 0, (A, s, S) € split} 
. merge = {(A,A; r ,A r ', s A« s ,{l}) | r,r',s G Q^} 

• deep-merge = {(A, xsx, S) \ x G 0, (A, s,S) G merge} 

• deep-deep-merge = {(X,xsx, S) \ x G 0, (A, s,S) G deep-merge} 

• convert-to = {(A, A°^DmD 9 , S) \ v,w G Q (+ \ (q , (v,w),m, S,q) G E} 

• convert-from = {(A, □ ? mDA* s , S) \r,s G Q^~\ (q, (r, s),m, S,qf) G E, qf G F} 
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• deep-convert-from = {(X,xsx,S) \ x G 0, (\,s,S) G con vert- from} 

• cancel = {(A, A° w A* s , C v>w ^ s ) \v,w€ Q {+ \ r,s G Q^} 

• deep-cancel = {(A, xsx, S) \ x G 6, (A, s,S) G cancel} 

• deep-deep-cancel = {(A, aisx,S) \ x G 6, (AjS,^) G deep-cancel} 

We will now describe the transducer A in detail. Although A will have A-transitions, 
we will argue later that every element of T(A) can be accepted by A using a computation 
that uses only a bounded number of A-transitions before and after every non-A-transition. 
Thus, it is clearly possible to transform A into an equivalent A-free valence transducer 
over M*BH 

A is obtained from A by removing all edges and then for each edge (p, x, L rjS J tjU R VtW ,q), 
gluing in the automaton 



(A,A« S ,{1» ( x ,a v mn z ,s) ^ (A,A° jto ,{l}) 
— (D W - (17) 

between p and q for every edge (y, (t,u),m, S, z) in D. Furthermore, on every state of 
A (including those in the glued in automata), we add loops labeled with the operations 
defined above. Finally, we add a loop labeled (A, A,* r , {1}) for each r G on the 

initial state and a loop labeled (A, A"^, {1}) for each v G on each final state. 

By the definition, it is clear that &(T(A)) C T(A). On the other hand, we can accept 
every pair (w,c) G T{A) by A in the following way. First we bring a symbol A* r on 
the stack to represent an empty storage. The first simulated edge (p,x, L r)S J t)U R v , w ,q), 
which has s = r, will thus be able to take the edge from state 1 to 2 in[T71 We assume 
that on the stack, there are no symbols of the form A* s except for one representing the 
empty stack. Thus, the monoid element in the configuration is contained in 

{A; m UmUy | v,w G Q (+) , m G M, y G Q}* U {A; r | r G Q H }. 

For each edge (p,x, L rtS Jt tU R V}W ,q) in the computation in A: 

1. Apply a sequence of cancel/deep-cancel, convert-from /deep-convert-from, merge, 
and split/deep-split loops in state 1 to obtain the symbol A' s on top of the stack. 
Note that for this, we need to use a split or deep-split loop at most once, namely 
for the lowest used occurrence of a A°, ,, which might be canceled only partially. 



2. Use the edge (A, A« s , {1}) in (fTT|) . 

3. If necessary, use a split loop in state 2. 

4. If necessary, use a convert-to loop in state 2. 

5. Choose an edge (x, □ y mD z , S) in (fT7|) . 

6. Use the edge (A, A° w , {!}) in JE 
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Note that the only one of these phases which uses an unbounded number of operations 
is the first one. Therefore, we will change the computation by moving the operations of 
this phase to the point where the modified symbols are created. Thereby, we guarantee 
that before the application of (A, A* S ,{1}) we only need a bounded number of steps. 
This is done as follows. In each of the phases [2] through [6j 

(i) After each introduction of a A° w (by a split/deep-split or by adding A° jW directly): 
if this occurrence is eventually canceled (without being split), cancel it now. This 
can be done using deep-cancel or deep-deep-cancel. 

(ii) After each application of a DjmD e : if the corresponding subsequence □m'D z is 
eventually converted (without adding another \D z m"n\ z >), convert it now. This can 
be done using convert-from. 



(iii) Whenever a symbol A* s produced by (i) or (ii) is eventually merged with a sym 



bol below it, merge them now. This can be done using merge, deep-merge, or 
deep-deep-merge. 

If we also obey these rules in the in the first phase, we can assume that any A° 
or DmC^-subsequence that was canceled/converted and then merged with the current 
underlying symbol in the old computation, is now already merged. Therefore, we can 
change the first phase so as to do only a bounded number of operations to obtain A* s 



on top of the stack. Note that obeying rules (i) through (iii) will not yield an unbounded 



number of operations, since in the first phase, there is at most one occurrence of split or 
deep-split. 

In the end, the stack should contain a symbol A°^, v € Q^ + \ to represent the empty 
storage. This can then by removed by the loop labeled (A, A°„, {1}) on the final state. 

Thus, any (w, c) E T(A) can be produced by a computation in A using only a bounded 
number of A-transitions before and after any input symbol. Hence, A can be easily 
transformed into an equivalent valence transducer with no A-transitions. □ 

We are now ready to prove the first main result. 



Proof of Theorem \3.1[ Clearly, (2) and (3) each imply [(4)| Lemma [5. 5 1 shows that |(4)| im- 



plies |(5)| By Lemma f7. 1\ 17.61 17,7} and l7.9} (5) implies that Mr is strongly A-independent. 



By Lemma 17.31 this implies (1) Finally, by Lemma 15.41 1(1)| implies (2) and |(3)[ □ 



We will now prove Theorem 13.31 By Theorem 13. 1\ we already know that when r < 1, 
we have VA(MT) = VA A (Mr). Hence, we only have to show that VA(MT) C VA A (MT) if 
r > 2. Greibach [12] and, independently, Jantzen [16^17] have shown that the language 



L x = {wc n | w € {0, 1}*, n < bin(w)}, 

can be accepted by a partially blind counter machine with two counters, but not without 
A-transitions. Here, bin(u>) denotes the number obtained by interpreting w as a base 2 
representation: 

bin(iol) = 2 • bin(w) + 1, bin(wO) = 2 • bin(to), bin(A) = 0. 
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Since we have to show VA(B r xZ s )C VA A (B r x Z s ) and we know L x G VA A (B r x Z s ), it 
suffices to prove L\ £ VA(B r x We do this by transforming Greibach's proof into a 
general property of languages accepted by valence automata without A-transitions. We 
will then apply this to show that L 1 £ VA(B r x If). 

Definition 7.10. Let M be a monoid. For x,y £ M, write x = y iff x and y have the 
same set of right inverses. For a finite subset S C M and n £ N, let fM,s( n ) be the 
number of equivalence classes of = in S n P R(M). 

The following notion is also used as a tool to prove lower bounds in state complex- 
ity of finite automata [10]. Here, we use it to prove lower bounds on the number of 
configurations that an automaton must be able to reach in order to accept a language 
L. 

Definition 7.11. Let n £ N. An n- fooling set for a language L C 0* is a set F C n x0* 
such that 

• for each (u, v) £ F, we have uv £ L, and 

• for (ui, v\), (u2,V2) £ F such that u\ ^ U2, we have u\V2 £ L or U2V i ^ L. 
The function gi : N — > N is defined as 

9L( n ) = max{\F\ \ F is an n- fooling set for L}. 

Lemma 7.12. Let M be a monoid and L £ VA(Af). Then there is a constant k £ N 
and a finite set S C M such that gL( n ) < k ■ fM,s( n ) f or °W n G N. 

Proof. Let A; be the number of states in the automaton for L and S be the elements ap- 
pearing on edges. Suppose gL(n) > A;-/^ i s , (n) for some n and let F = {(u%, v%), . . . , (u m , t>, 
be an n-fooling set for L, with m > k- fM,s( n )- Since UiVi £ L for 1 < i < m, we have an 
accepting computation for each of these words. Let (qi,Xi) be the configuration reached 
in such a computation after reading Ui, 1 < i < m. Since the automaton has no A- 
transitions, we have Xi £ S n for any 1 < i < m. Moreover, since a final configuration 
is reachable from (qi,Xi), we also have X{ £ R(M). Furthermore, since m > k ■ fM,s{ n )i 
there are indices i ^ j with qi = qj and Xi = xj. This means however, that mvj £ L and 
UjVi £ L, contradicting the fooling set condition. □ 

Lemma 7.13. For L = L\, we have gi{ n ) > 2 n for any n £ N. 

Proof. Let n £ N, and let F consist of all (n, v) such that u £ {0, 1}™ and v = c hm<yU \ 
Then, F is an n-fooling set for L with \F\ > 2 n : we have uv £ L for any G i 7 . 

Furthermore, if u ^ v! for (u, v), (u' , v') £ F, assume bin(ii) < bin(n'). Then uv' £ L. 

□ 

Lemma 7.14. Let M = B r x Z s for r, s £ N and S £ M a finite set. Then f M ,s is 
bounded by a polynomial. 
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Proof. Every x € B can be written uniquely as x = a k a . We define \x\ = t. For y £ Z, 
we have the usual absolute value \y\. Thus, for z € B r X Z s and 2 = (a?i, . . . , x r ,yi . . . , y s ) 
we can define 

\z\ = max{|a;j|, \yj\ | 1 < i < r, 1 < j < s}. 

Let m = max{|x| | x E 5}. Then, for z € 5™ D R(M), we have |z| < m ■ n. Since every 
element in R(M) is of the form (c/ 1 , . . . , a ir , y\,..., y s ), we have 

fM,s(n) ^ l^ n nR(M)| < (m-n + l) r ■ (2 • m ■ n + l) s , 

which is a polynomial in n. □ 

Proof of Theorem\EM If r < 1, Theorem O already implies that VA(Mr) = VA A (Mr). 
If r > 2, we have L x G VA A (Mr), but Lemmas ESI and EH together imply that 
Li £ VA(MT). □ 



Acknowledgements The author would like to thank Nils Erik Flick, Reiner Hiichting, 
Matthias Jantzen, and Klaus Madlener for comments that improved the presentation of 
the paper. 



References 

[1] Ronald V. Book and Friedrich Otto. String-Rewriting Systems. Springer- Verlag 
New York, Inc., 1993. 

[2] Ahmed Bouajjani, Javier Esparza, and Oded Maler. Reachability analysis of push- 
down automata: Application to model-checking. In Antoni W. Mazurkiewicz and 
Jozef Winkowski, editors, CONCUR '97: Concurrency Theory, 8th International 
Conference, Warsaw, Poland, July 1-4, 1997, Proceedings, volume 1243 of Lecture 
Notes in Computer Science, pages 135-150. Springer, 1997. 

[3] Didier Caucal. On infinite transition graphs having a decidable monadic theory. 
Theoretical Computer Science, 290(1):79 - 115, 2003. 

[4] Volker Diekert and Grzegorz Rozenberg, editors. The Book of Traces. World Sci- 
entific, Singapore, 1995. 

[5] Murray Elder, Mark Kambites, and Gretchen Ostheimer. On groups and counter 
automata. International Journal of Algebra and Computation, 18(08):1345-1364, 
2008. 

[6] Gillian Z. Elston and Gretchen Ostheimer. On groups whose word problem is solved 
by a counter automaton. Theoretical Computer Science, 320(23):175-185, 2004. 

[7] Robert H. Gilman. Formal Languages and Infinite Groups, volume 25 of DIM ACS 
Series in Discrete Mathematics and Theoretical Computer Science. 1996. 



31 



[8] Seymour Ginsburg and Edwin H. Spanier. Bounded algol-like languages. Transac- 
tions of the American Mathematical Society, 113(2):333-368, 1964. 

[9] Seymour Ginsburg and Edwin H. Spanier. Semigroups, presburger formulas, and 
languages. Pacific Journal of Mathematics, 16(2):285-296, 1966. 

[10] Ian Glaister and Jeffrey Shallit. A lower bound technique for the size of nondeter- 
ministic finite automata. Information Processing Letters, 59(2):75-77, 1996. 

[11] Sheila A. Greibach. A new normal- form theorem for context-free phrase structure 
grammars. Journal of the ACM, 12(l):42-52, 1965. 

[12] Sheila A. Greibach. Remarks on blind and partially blind one-way multicounter 
machines. Theoretical Computer Science, 7(3):311 - 324, 1978. 

[13] Gerard Huet. Confluent reductions: Abstract properties and applications to term 
rewriting systems. Journal of the ACM, 27(4):797-821, 1980. 

[14] Oscar H. Ibarra, Sartaj K. Sahni, and Chul E. Kim. Finite automata with multi- 
plication. Theoretical Computer Science, 2(3):271 - 294, 1976. 

[15] Masami Ito, Carlos Martin- Vide, and Victor Mitrana. Group weighted finite trans- 
ducers. Acta Informatica, 38:117-129, 2001. 

[16] Matthias Jantzen. Eigenschaften von Petrinetzsprachen. PhD thesis, Universitat 
Hamburg, 1979. 

[17] Matthias Jantzen. Eigenschaften von petrinetzsprachen. Technical Report IFI-HH- 
B-64/79, Institut fiir Informatik, Universitat Hamburg, 1979. 

[18] Mark Kambites. Formal languages and groups as memory. Communications in 
Algebra, 37:193-208, 2009. 

[19] Markus Lohrey and Benjamin Steinberg. The submonoid and rational subset mem- 
bership problems for graph groups. Journal of Algebra, 320(2) :728-755, 2008. 

[20] Victor Mitrana and Ralf Stiebe. Extended finite automata over groups. Discrete 
Applied Mathematics, 108(3):287-300, 2001. 

[21] Elaine Render. Rational Monoid and Semigroup Automata. PhD thesis, University 
of Manchester, 2010. 

[22] Elaine Render and Mark Kambites. Rational subsets of poly cyclic monoids and 
valence automata. Information and Computation, 207(11):1329 - 1339, 2009. 

[23] Jan van Leeuwen. A generalisation of parikh's theorem in formal language theory. 
In J. Loeckx, editor, Automata, Languages and Programming, volume 14 of Lecture 
Notes in Computer Science, pages 17-26. Springer Berlin / Heidelberg, 1974. 



32 



